SemanticScuttle - klotz.me » klotz: fine tuning

klotz: fine tuning*

Bookmarks on this page are managed by an admin user.

Tuning Language Models by Proxy This bookmark is certified by an admin user.

Introduces proxy-tuning, a lightweight decoding-time algorithm that operates on top of black-box LMs to achieve the same end as direct tuning. The method tunes a smaller LM, then applies the difference between the predictions of the small tuned and untuned LMs to shift the original predictions of the larger untuned model in the direction of tuning, while retaining the benefits of larger-scale pretraining.

2024-05-11 Tags: proxy, fine tuning, llm, llama2-70b by klotz

mistral-doc: Fine-tuning an LLM on my ChatGPT conversations - Duarte O.Carmo This bookmark is certified by an admin user.

2024-04-14 Tags: llm, mistral, fine tuning by klotz

Lightning-AI's LitGPT Tutorial This bookmark is certified by an admin user.

- GitHub repository for a tutorial series called "0 to LitGPT."
- Provides an overview of how to get started with LitGPT, which is an open-source implementation of GPT-3.
- Offers various resources such as codes, issues, pull requests, actions, security features, insights, and more related to the LitGPT project.

2024-03-28 Tags: llm, fine tuning, github, litegpt, tutorial by klotz

understanding-using-and-finetuning-gemma This bookmark is certified by an admin user.

2024-02-24 Tags: llm, gemma, fine tuning, google by klotz

Mistral Fine Tuning with QLora This bookmark is certified by an admin user.

2024-02-22 Tags: mistral, fine tuning, llm, qlora by klotz

Fine Tuning LLM on RTX 3090 This bookmark is certified by an admin user.

- Discusses the use of consumer graphics cards for fine-tuning large language models (LLMs)
- Compares consumer graphics cards, such as NVIDIA GeForce RTX Series GPUs, to data center and cloud computing GPUs
- Highlights the differences in GPU memory and price between consumer and data center GPUs
- Shares the author's experience using a GeForce 3090 RTX card with 24GB of GPU memory for fine-tuning LLMs

2024-02-02 Tags: llm, fine tuning, nvidia, rtx, 3090, self-hosted by klotz

GitHub unsloth: 5X faster 60% less memory QLoRA finetuning This bookmark is certified by an admin user.

2024-01-29 Tags: llm, qlora, fine tuning, github, quantization by klotz

How to fine-tune an open-source LLaMa using QLoRa This bookmark is certified by an admin user.

2024-01-28 Tags: llm, qlora, llama, fine tuning, quantization by klotz

Preference Tuning LLMs with Direct Preference Optimization Methods This bookmark is certified by an admin user.

2024-01-18 Tags: llm, dpo, fine tuning, huggingface by klotz

Supervised Fine Tuning This bookmark is certified by an admin user.

2024-01-17 Tags: sft, llm, fine tuning by klotz

First / Previous / Next / Last / Page 1 of 0